A Hypergraph-Based Workload Partitioning Strategy for Parallel Data Aggregation
نویسندگان
چکیده
This paper presents an algorithm to e ciently carry out data aggregation operations on large disk-based datasets on a parallel machine. This algorithm employs a hypergraph formulation for partitioning the workload among processors. Data aggregation is a common operation executed by applications that explore and analyze very large multi-dimensional scienti c datasets. A data element in these datasets is associated with coordinates in a multi-dimensional attribute space. Depending on the application area, the datasets may include data produced by scienti c simulations and experimental measurements obtained from sensors attached to satellites and microscopes, and from seismic data and tomographic imaging techniques. In many cases, analysis is performed by (1) selecting a subset of the dataset, (2) mapping input data elements to output data elements, and (3) computing or updating the values of an output element by aggregating the values of the input elements that map to the same output element. Mapping between input and output elements is often based on the coordinates in the underlying attribute space(s). Note that an input element may map to multiple output elements, and an output element may be mapped to by multiple input elements. Typical aggre-
منابع مشابه
Image-Space-Parallel Direct Volume Rendering on a Cluster of PCs
An image-space-parallel, ray-casting-based direct volume rendering algorithm is developed for rendering of unstructured data grids on distributed-memory parallel architectures. For efficiency in screen workload calculations, a graph-partitioning-based tetrahedral cell clustering technique is used. The main contribution of the work is at the proposed model, which formulates the screen partitioni...
متن کاملPartitioning, Ordering, and Load Balancing in a Hierarchically Parallel Hybrid Linear Solver
Abstract PDSLin is a general-purpose algebraic parallel hybrid (direct/iterative) linear solver based on the Schur complement method. The most challenging step of the solver is the computation of a preconditioner based on an approximate global Schur complement. We investigate two combinatorial problems to enhance PDSLin’s performance at this step. The first is a multiconstraint partitioning pro...
متن کاملHypergraph-Partitioning-Based Decomposition for Parallel Sparse-Matrix Vector Multiplication
ÐIn this work, we show that the standard graph-partitioning-based decomposition of sparse matrices does not reflect the actual communication volume requirement for parallel matrix-vector multiplication. We propose two computational hypergraph models which avoid this crucial deficiency of the graph model. The proposed models reduce the decomposition problem to the well-known hypergraph partition...
متن کاملRevisiting Hypergraph Models for Sparse Matrix Partitioning
We provide an exposition of hypergraph models for parallelizing sparse matrix-vector multiplies. Our aim is to emphasize the expressive power of hypergraph models. First, we set forth an elementary hypergraph model for parallel matrix-vector multiply based on one-dimensional (1D) matrix partitioning. In the elementary model, the vertices represent the data of a matrix-vector multiply, and the n...
متن کاملSimultaneous Input and Output Matrix Partitioning for Outer-Product-Parallel Sparse Matrix-Matrix Multiplication
For outer-product–parallel sparse matrix-matrix multiplication (SpGEMM) of the form C=A×B, we propose three hypergraph models that achieve simultaneous partitioning of input and output matrices without any replication of input data. All three hypergraph models perform conformable one-dimensional (1D) columnwise and 1D rowwise partitioning of the input matrices A and B, respectively. The first h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001